A Design and Development of Word Sense Disambiguation Algorithm for English Language Understanding for Database Access

نویسندگان

  • M. Munusamy
  • G. Tholkappia Arasu
  • V. Palanisamy
  • S. Selvarajan
چکیده

Problem statement: This study attempts to present an object-net method for word sense disambiguation. It is proposed to model the elementary meanings which assist the machine to autonomously undertake the analysis and synthesis processes of meaning. Approach: In the proposed methodology, the disambiguation process was performed in context manner. Starting from natural text, the context of the sentence was identified, then the actual meaning identified using correlation of elementary object meanings existed in object-net database. It was because even ambiguous word will have only one meaning based on the context or object or domain on which the sentence was written. Results: This object-net approach disambiguates original text with high precision of 96% of the verbs and 97% of nouns for data extraction from the database and reporting in terms of graphs. Conclusion: The accuracy of finding the sense of a word and extracting data from the database and projecting into graphs was based on number of trained objects in object-net database. Due to this object-net database plays a major role in this proposed method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and implementation of Persian spelling detection and correction system based on Semantic

Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors.  Also developing Persian tools will provide Persian progr...

متن کامل

Sense Extraction and Disambiguation for Chinese Words from Bilingual Terminology Bank

Using lexical semantic knowledge to solve natural language processing problems has been getting popular in recent years. Because semantic processing relies heavily on lexical semantic knowledge, the construction of lexical semantic databases has become urgent. WordNet is the most famous English semantic knowledge database at present; many researches of word sense disambiguation adopt it as a st...

متن کامل

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

Material Development and English for Academic Purposes Word Lists; a Reductionist Approach

Nagy (1988) states that vocabulary is a prerequisite factor in comprehension. Drawing upon a reductionist approach and having in mind the prospects for material development, this study aimed at creating an English for Academic Purposes Word List (EAPWL). The corpus of this study was compiled from a corpus containing 6479 pages of texts, 2,081,678 million tokens (running words) and 63825 types (...

متن کامل

A monolingual semantic decoder based on word sense disambiguation for mixed language understanding

In this paper, a new method for spoken mixed language understanding is presented. By mixed language, we mean that the words included in one sentence may come from different languages, a primary language and a secondary language. In conventional statistical semantic decoders, the conceptual structure is represented as a hidden Markov model, the decoding of the conceptual content of a sentence is...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011